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The recent effort by The Cancer Genome Atlas (TCGA) Network 
has revealed that gastric cancer, which is a leading cause of cancer- 
related deaths worldwide with a 5-year survival rate less than 25%, 
is a much more heterogeneous disease than previously thought. 
And yet, conventional treatment approaches and clinical trials have 
assumed it is a single disease. Although it is well known that under 
the microscope, gastric cancer cells appear quite different, the 
current classification scheme recognizes two main categories of 
gastric cancer: diffuse and intestinal. 

By integrating large-scale measurements of DNA, RNA, and 
proteins from 295 primary gastric tumors, the TCGA project found 
that there are four major subtypes of gastric cancer. The fact that 
these subtypes appear quite distinct on the molecular level suggests 
not only that the molecular processes driving tumorigenesis can 
vary among patients but also that the treatments may have to 
be tailored based on the subtype of tumor a patient has. This 
molecular stratification of patients, enabled by large-scale molecular 
characterization, is a significant step towards personalized therapy. 

The data that are generated from tumors in the TCGA project 
are not only very large in terms of volume, but, perhaps even more 
importantly, highly heterogeneous. Six different molecular analysis 
technologies were used for the molecular characterization of 
gastric cancer. These technologies provided information on DNA 
mutations, amplifications or deletions of gene sequences, epigenetic 
modifications of DNA, and levels of mRNA, microRNA, and proteins. 
Additionally, coded clinical information on pathology, histology, 
tumor characteristics, and other relevant data on each patient was 
collected. 

The computational challenge is how to integrate all this 
information not only to see broad differences among tumors but also 
to identify strong statistical associations among all these molecular 
and clinical data. Such associations may provide clues to how 
molecular systems in cancer cells are disrupted in different subtypes 
of gastric cancer and what treatment strategies may be most 
effective. 

For example, one of the four subtypes is characterized by the 
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presence of the Epstein-Barr virus (EBV). Patients having tumors 
of this subtype tend to also have mutations in the PIK3CA pathway, 
extreme DNA hypermethylation, and extra copies of PD-L1 and 
PD-L2 genes, which are suppressors of immune response. These 
findings suggest that inhibitors of the PI3-K pathway may be of 
potential use for this subtype of gastric cancer. Further, PD-L1/2 
antagonists may help promote immune destruction of the tumor. 

Another example of a statistical association that may have 
clinical use is the finding that frequent mutations in a gene called 
RHOA occur predominantly in the subtype of gastric cancer termed 
"genomically stable," which is characterized by the lack of high 
levels of aneuploidy and are predominantly diffuse-type tumors. 
These subtypes of tumors have a higher likelihood of metastasis 
and are more invasive. It is known that the gene product of RHOA 
interacts with other proteins to alter cellular shape and motility, which 
are important for tumor growth. This protein or pathway may be an 
important therapeutic target for the genomically stable subtype of 
gastric cancer. 

The large-scale statistical analysis of the data revealed 
numerous other prevalent associations among factors, such as 
age at diagnosis, anatomic region of the stomach where the tumor 
occurs, gender, mutation status of genes such as TP53, and 
other tumor or patient characteristics. Although identifying such 
relationships in heterogeneous cancer data is important for the initial 
discovery process, much work lies ahead to functionally characterize 
the molecular aberrations in cancer and to develop and test effective 
targeted therapies. 

The ever-increasing volume and complexity of molecular 
data from cancer, exemplified by the TCGA project, calls for more 
sophisticated analytical approaches that can detect multivariate 
relationships in heterogeneous data and map that information onto 
the existing body of knowledge of the structure of molecular networks 
that govern all cellular processes. Other large-scale molecular 
characterization projects, such as the ENCyclopedia Of DNA 
Elements (ENCODE) and the Genomics of Drug Sensitivity in Cancer, 
are producing highly complementary data that will allow scientists to 
paint a more complete picture of how molecular regulatory processes 
are disrupted in cancer cells and what drugs may be effective in 
targeting a cancer's weaknesses. This first large-scale molecular 
characterization and analysis of gastric cancer undertaken by the 
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TCGA project takes us one step closer toward helping patients suffering from this deadly type of cancer. 
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